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BACKGROUND OF THE INVENTION 

Field of the Invention 

[0003] The present invention relates to matched filters and correlators and, more 

particularly, to a novel fast correlator transform ("FCT") and to methods and 
systems for implementing same. 

Related Art 

[0004] Matched Filter Theory was introduced by D.O. North in 1943. Others 

such as Van Vleck, Middleton, Weiner, and Turin, have added to the body of 
knowledge and application, ranging from RADAR to SONAR and 
communications. Although the basic theory was originally applied to continuous 



time systems and linear networks, the theory has been augmented to absorb 

discrete sampled filters and correlator operations. Indeed Paul Green pointed out 

in the June 1960 IRE Transactions on Information Theory, Matched Filter Issue: 

"Correlation detection was studied at first as a separate subject, but 
the equivalence of the two operations (meaning matched filters 
and correlators) was soon appreciated." 

[0005] IRE Transactions on Information Theory, New York, NY: Professional 

Group on Information, Institute of Radio Engineers, June, 1960, incorporated 
herein by reference in its entirety. 

[0006] More recently Van Whalerf and Blahut as well as others have provided 

proofs of mathematical equivalence of correlation or matched filtering based on a 
change of variables formulation. 

[0007] With the surge of DSP and proliferation of VLSI CMOS circuits as well as 

the universal push for software radios, many discrete matched filter realizations 
have evolved. The most notable digital implementation is the Transversal or 
Finite Impulse Response Filter which may realize the time flipped impulse 
response of the waveform to be processed or utilized as a correlator, both which 
produce the equivalent result at the optimum observation instant. 

[0008] A particular concern arises when multiple filtering operations are required, 

concurrently, as is the case for parallel correlators. The complexity, clock speeds 
and signal flow control typically increase cost, size, and power. 

[0009] What are needed are improved methods and systems for performing 

matched filtering and/or correlating functions, including concurrent and/or 
parallel correlations. 
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[0010] The present invention is directed to methods and systems for performing 

matched filtering and/or correlating functions, including concurrent and/or 
parallel correlations. More particularly, the present invention is directed to a fast 
correlator transform (FCT) algorithm and methods and systems for implementing 
same. The invention is useful for, among other things, correlating an encoded 
data word (Xo-Xm-i) with encoding coefficients (Co-Cm-i), wherein each of (Xo- 
Xm-i) is represented by one or more bits and each coefficient is represented by 
one or more bits, wherein each coefficient has k possible states, and wherein M is 
greater than 1. 

[0011] In accordance with the invention, X 0 is multiplied by each state (Co(o> 

through Co(k-i)) of the coefficient Co, thereby generating results X 0 C 0 (o) through 
XoCo(k-i). This is repeated for data bits (Xi-X M -i) and corresponding coefficients 
(Ci-Cm-i)> respectively. The results are grouped into N groups. Combinations 
within each of said N groups are added to one another, thereby generating a first 
layer of correlation results. 

[0012] The first layer of results is grouped and the members of each group are 

summed with one another to generate a second layer of results. This process is 
repeated as necessary until a final layer of results is generated. The final layer of 
results includes a separate correlation output for each possible state of the 
complete set of coefficients (Co-Cm-i). The results in the final layer are compared 
with one another to identify the most likely encoded data word. 

[0013] In an embodiment, the summations are pruned to exclude summations that 

would result in invalid combinations of the encoding coefficients (Co-Cm-i). In an 
embodiment, substantially the same hardware is utilized for processing in-phase 
and quadrature phase components of the data word (Xo-X M i). In an embodiment, 
the coefficients (Cq-Cm-i) represent real numbers. In an alternative embodiment, 



the coefficients (Co-C M -i) represent complex numbers. In an embodiment, the 
coefficients (Co-Cm-i) are represented with a single bit. Alternatively, the 
coefficients (Co-Cm-i) are represented with multiple bits (e.g., magnitude). In an 
embodiment, the coefficients (Co-C M -t) represent a cyclic code keying ("CCK") 
code set substantially in accordance with EEEE 802.1 1 WLAN standard. 
[0014] Further features and advantages of the invention, as well as the structure 

and operation of various embodiments of the invention, are described in detail 
below with reference to the accompanying drawings. It is noted that the invention 
is not limited to the specific embodiments described herein. Such embodiments 
are presented herein for illustrative purposes only. Additional embodiments will 
be apparent to persons skilled in the relevant art(s) based on the teachings 
contained herein. 

BRIEF DESCRIPTION OF THE DRAWINGS/FIGURES 

[0015] The present invention will be described with reference to the 

accompanying drawings. The drawing in which an element first appears is 
typically indicated by the leftmost digit(s) in the corresponding reference number. 

[0016] FIG. 1 is a block diagram of an example discrete transversal matched filter 

or correlator, in which the present invention can be implemented. 

[0017] FIG. 2 A is an expanded view of a summation function 102 illustrated in 

FIG. 1. 

[0018] FIG. 2B is a block diagram of a FCT kernel implemented in an 802.11 

demodulator 2 1 0 in accordance with an aspect of the present invention. 

[0019] FIG. 3 A illustrates example correlation kernels, in accordance with an 

aspect of the present invention. 

[0020] FIG. 3B illustrates example first and second layers of an FCT processing 

hierarchy in accordance with an aspect of the invention. 



[0021] FIG. 3C illustrates an example signal flow diagram for the first and second 

layers illustrated in FIG. 3B. 
[0022] FIG. 4A illustrates a conventional parallel correlator approach. 

[0023] FIG. 4B illustrates an example matrix form of coefficients for a parallel 

correlator in accordance with an aspect of the present invention. 
[0024] FIG. 5 illustrates an example complex fast Hadamard Transform. 

[0025] FIG. 6 illustrates an example parallel magnitude compare operation in 

accordance with an aspect of the invention. 
[0026] FIG. 7 illustrates an example final layer of an FCT processing hierarchy in 

accordance with an aspect of the invention. 
[0027] FIG. 8 illustrates an example process flowchart for implementing an FCT 

algorithm in accordance with an aspect of the present invention. 
[0028] FIG. 9 is a signal path diagram for an example CCK decoder output trellis, 

including an I signal path 902 and a Q signal path 904. 
[0029] FIG. 10 is a block diagram of an inverted channel detection circuit 1000 

that can be used to determine if one of the channels was inverted. 
[0030] FIG. 1 1 is a block diagram of a complex correlation in accordance with 

equation 12. 

[0031] FIG. 12 is a block diagram of an example FCT 1200 in accordance with 

the present invention. 

[0032] FIG. 13 is a block diagram of another example FCT 1300 in accordance 

with the present invention, including stages la, lb, 2a, 2b, and 3. 

[0033] FIG. 14 is an example block diagram of stage la of the FCT 1300. 

[0034] FIG. 1 5 is an example block diagram of stage lb of the FCT 1 300. 

[0035] FIG. 16 is an example block diagram of stage 2a of the FCT 1300. 

[0036] FIG. 17 is an example block diagram of stage 2b of the FCT 1300. 

[0037] FIG. 18 is a block diagram of an example system 1800 that calculates An 

values. 
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[0038] FIG. 19 is a block diagram of connection stages for an example simplified 
architecture FCT 1900, including stages la, lb, 2a, 2c, and 3. 

[0039] FIG. 20 is an example block diagram of stage la of the FCT 1900. 

[0040] FIG. 21 is an example block diagram of stage lb of the FCT 1900. 

[0041] FIG. 22 is an example block diagram of stage 2a of the FCT 1900. 

[0042] FIG. 23 is an example block diagram of stage 2b of the FCT 1900. 

[0043] FIGS. 24A and B are an example block diagram of stage 3 of the FCT 
1900. 

[0044] FIG. 25 illustrates an example structure 2500 for implementing equation 
.17. 

[0045] FIG. 26 is another example block diagram of stages of the FCT 1 900. 

[0046] FIG. 27 is another example block diagram of stages of the FCT 1 900. 
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DETAILED DESCRIPTION OF THE INVENTION 

TABLE OF CONTENTS 

I. Introduction 

II. Example Environment: 802.11 

III. Fast Correlator Transform and Correlator Kernels 

IV. Mathematical Formulation 

V. Comparisons to the Hadamard Transform 

VI. Maximum Likelihood Decoding (AWGN, no Multipath) 

A. Magnitude Comparator 

VII. Example Methods for Implementing the FCT Algorithm 

VIII. CCK Chip Code Words 
DC CCK Decoder 

A. Introduction 

X. . CCK Correlator Optimizations 

A. Introduction 

B. Initial Complex Correlation 

C Fast Correlator Transform (FCT) 
D. FCT Based on CCK Code Properties 

XI. Conclusion 

I. Introduction 

[0047] FIG. 1 is a block diagram of an example discrete transversal matched filter 

or correlator 100, also referred to herein as a finite impulse response ("FIR") filter 
100. The FIR filter 100 includes a set of multipliers 104 that multiply data words 
Xi by coefficients C. The FIR filter 100 also includes a final accumulate or 



summation function /102. The matched filter 100 implements the following 
discrete sampling equation; 

n-1 

yi=X CkX <H0 (Eq. i) 

k=0 

[0048] In Eq. 1, Xj are typically derived from a sampling device such as an A/D 

converter and can be represented as signed magnitude, two's complement, or 
other binary representation in parallel bit format. Likewise the multiplier tap 
coefficients C 0 . . . C n can be 1 bit values or k bit (soft) values depending on the 

application. In the case of soft value implementations, the multiplier functions 
can represent considerable hardware or algorithmic complexity. The final 
summation can be implemented using discrete adder technologies or floating 
point processors. The entire architecture can be implemented in hardware, 
software, or a hybrid of each. 

[0049] A particular concern arises when multiple filtering operations are required, 

concurrently, as is the case for parallel correlators. The complexity, clock speeds 
and signal flow control typically increase cost, size, and power. Hence, efficient 
architectures are often pursued to streamline the implementation and thereby 
differentiate product offerings. 

[0050] The present invention is directed to a novel fast correlator transform 

("FCT") algorithm that reduces the number of additions for parallel correlation, as 
compared to conventional techniques. The present invention is also directed to 
methods and systems for implementing the novel FCT algorithm. Conventional 
techniques are typically of less universal application and restricted to certain 
signaling schemes. The present invention, on the other hand, is applicable to a 
variety of parallel matched filter and/or correlator operations. 

[0051] The present invention is as efficient or more efficient than the Fast Walsh 

Transform, currently applied as the "state of the art," and is more universally 
applicable to signaling schemes employing higher order signaling spaces such as 



MQAM, CDMA, etc. In addition, classical multi-path processor algorithms are 
more easily applied using the classical correlator/matched filter kernel, in 
accordance with the present invention, rather than the conventional modified Fast 
Walsh Transform. 

[0052] The present invention can be implemented in hardware, software, 

firmware, and/or combinations thereof For example, and without limitation, the 
invention, or portions thereof, can be implemented in hardware logic. The 
invention, or portions thereof, can also be implemented in software imbedded in a 
computer readable medium (i.e., a computer program product), which, when 
executed on a computer system, causes the computer system to perform in 
accordance with the invention. 

[0053] 

II. Example Environment: 802.11 

[0054] The present invention is described herein as implemented in an example 

environment of an IEEE 802.11b 11 MBPS physical layer signaling scheme. 
IEEE 802.11 is a well-known communications standard and is described in, for 
example, "Medium Access Control (MAC) and Physical (PHY) Specifications," 
ANS/DEE Std 802.11, published by IEEE, (1999Ed.), and incorporated herein by 
reference in its entirety. 

[0055] The present invention is not, however, limited to the IEEE 802.11 

communications standard. Based on the description herein, one skilled in the 
relevant art(s) will understand that the present invention can be implemented for a 
variety of other applications as well. Such other applications are within the spirit 
and scope of the present invention. 

[0056] A distinction is made herein between a basic correlator kernel and a full 

demodulator. A general form of a demodulator for IEEE 802.11 typically 
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requires four correlator banks operating essentially in parallel. A fast correlator 
transform ("FCT") kernal, in accordance with the invention, typically includes 
similar structure, plus complex additions and subtractions from two in-phase and 
two quadrature-phase banks to accomplish the demodulation for IEEE 802.11. 
The Walsh transform, as a comparison, accounts for these additional adds and 
subtracts by having them built into its algorithm. 

[0057] The 802.1 1 signaling scheme of interest is based on Cyclic Code Keying 

("CCK") derived from Walsh/Hadamard functions. A restricted set within the 
available coding space was selected so that the Fast Walsh Transform could be 
utilized to implement an efficient correlator architecture. Originally, both Harris 
and Lucent could not figure out how to apply a classical parallel matched filter or 
correlator, efficiently, to process the waveforms. The current coding space 
provides for 64 code words. Harris erroneously predicted that a classical parallel 
matched filter approach would require 8 x 64 - 512 complex additions since each 
: code word is 8 bits, on I and Q and there are 64 code words. However, the true 
estimate is 7 x 64 = 448 complex additions as illustrated in the example 8-way 
adder tree illustrated in FIG. 2A. 

[0058] FIG. 2 A is an expanded view of the final accumulate or summation 

function 102 in FIG. 1, part of the FIR filter 100. Notice that only 7 adders are 
required for the example implementation. Nevertheless, 448 complex additions 
represent a significant number of operations. Lucent, Harris/Intersil, and Alantro 
apply the Fast Walsh Transform ("FWT") to the CCK code set to reduce the 
correlation operation down to 1 12 complex multiplies due to the restriction placed 
on the code set. 

[0059] The FWT is actually more of a decoder than a correlator. It reduces to the 

performance of a correlator for this specific application if the coefficients in the 
butterfly branches are weighted with simple hard coded values, i.e., 1, -1, j, -j. 
The imaginary numbers are included for the case of complex signaling. 



[0060] The FCT algorithm, according to the present invention, is truly a 

correlation or matched filter operation and can be applied with soft value 
weighting or hard value weighting. Furthermore, the present invention is 
intuitively satisfying, possessing direct correspondence of matched filter tap 
weights or coefficients maintained throughout the hierarchical structure. This 
permits easy extension of the matched filter concept to accommodate channel 
equalization, MLSE, and other adaptive applications. 

[0061 ] FIG. 2B is a block diagram of an example FCT kernel implemented in an 

802.11 demodulator 210. The 802.11 demodulator 210 includes a bank of FCT 
kernels 212. The demodulator 210 can be reduced to 110 complex operations, 
with 78 of the complex operations allocated to the bank of FCT kernels 212. 
Additional simplification can be obtained by exploiting redundancy in the 
algorithm, as described below. 

III. Fast Correlator Transform and Correlator Kernels 

[0062] The available coding space for a 16 bit word is 2 16 = 65536. CCK whittles 

this space down to a code set formed by 8 in-phase ("I") chip and 8 quadrature 
phase ("Q") chip complex symbols. 64 code words are retained, carrying 6 bits of 
information. 2 more bits are contained in the complex QPSK representation, for a 
total of 8 bits of information. 

[0063] Suppose then that 8 samples of an input vector X 0 ,X 1f ...X 7 are 

associated with optimal sampling instants from the output of a chip matched filter. 
Each of these samples would normally be weighted by the coefficients C 0 . . . C 7 
then assimilated as illustrated in FIGS. 1 and 2A. 

[0064] In the 802. 1 1 example, C 0 . . . C 7 correspond to samples of the CCK code 

set symbols. The unknowns X 0 . . . X 7 are noisy input samples from which a 
specific code must be extracted. Conceptually, 64 such complex filtering 
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operations should be performed concurrently since a specific code word cannot be 
predicted a priori. The largest output from the parallel bank of correlators would 
then be selected as the most likely code word correlation match at that particularly 
symbol sample time. 

[0065] In accordance with the present invention, a general philosophy for 

correlation is imposed for partitioned segments of code words. In an 
embodiment, the code word correlations are divided into sub-sets. In the 
. illustrated example embodiment, the code word correlations are divided into 
sample pairs. The present invention is not, however, limited to this example 
embodiment. In other embodiments, the code word correlations are divided into 
triplets, quintuplets, and/or any other suitable sub-sets. 

[0066] Combinations of the code word correlation sub-sets are then provided to 

correlation kernels. FIG. 3A illustrates example correlation kernels 302a-302d, in 
accordance with an aspect of the present invention. 

[0067] The correlation kernels 302a-302d represent all or substantially all 

possible correlation options for the first 2 samples X 0f X 1 . In a similar fashion 

the remaining groupings of double samples (X 2 ,X 3 ), (X 4 ,X 5 ), (X 6 ,X 7 ) will 

also spawn their own set of 4 correlation kernels. 

[0068] The number of separate correlation kernels spawned is determined by the 

number of correlated samples per summation, the number of correlation variations 
possible, and, in an embodiment, the number of invalid combinations of 
correlated samples. In the present example, each coefficient has two possible 
states (i.e., hard value weighting). Thus each subset correlation generates 4 
outputs. In alternative embodiments, soft weighting is implemented, where each 
coefficient is represented with multiple bits (e.g., magnitude representation). 

[0069] In an embodiment, the number of correlation operations associated with 

binary antipodal signaling in accordance with the present invention is 
implemented in accordance with Eq. 2. 
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N k = " ! (Eq.2) 
k r!(n-r) M ' 

[0070] , The result for the example environment described above, using 2 input 
summers, is shown in Eq. 3: 

wherein: 

n is the number of uniquely available summer inputs; 
r is the number of summing inputs per kernel; and 

L is the number of invalid combinations. 

[0071] N k is thus the number of correlation kernels and therefore the number of 

adders or summers required. Groupings of two correlation samples provide for 
convenient binary expansion. As stated above, however, the present invention is 
not limited to groupings of two. 

[0072] The term L represents the number of invalid or disallowed combinations. 

For instance, X 0 C 0 and -X 0 C 0 is an invalid combination when added and 
therefore can be subtracted from the total number of combinations. In an 
embodiment, 3 way adders or summers are utilized. In other embodiments, other 
algorithmic partitioning is utilized. For the current example, partitioning in 
powers of 2 is convenient and attractive in terms of potential hardware 
implementation. 

[0073] FIG. 3B illustrates example first and second layers 304 and 306, 

respectively, of an FCT processing hierarchy in accordance with an aspect of the 
invention. 



- 14- 



[0074] The second layer 306 includes 32 additions 308, of which 308a through 

308p are illustrated, to process combinations of correlations from the first layer 
304 of correlation kernels. The first layer 304 includes 16 additions related to the 
first 4 sample correlations X 0 C 0 ...X 3 C 3 , and 16 additions related to the 2 nd 4 

sample correlations. Hence, the first layer 304 of kernels includes 16 adders 302 
and the second layer possesses 32 adders 308. Once the second layer 306 is 
accomplished, each term that results includes 4 correlation components. 
[0075] Note that 4 unique samples X 0 . . . X 3 at the system input spawns 2 4 unique 

4-tuple correlations at the second layer 306 of kernel processing. The 
corresponding number of additions is calculated for the 4 sample correlation 
sequences from Eq. 4: 

4! 



2!2! 



= 16 (Eq.4) 



[0076] At this point it is important to realize that all that is required is one more 

level of processing to obtain correlation terms consisting of 8 components which 
represents a full length correlation. However, it must also be recognized that there 
are 16 upper 4-tuple correlations as well as 16 lower 4-tuple correlations, which if 
exploited for all combinations in this case would yield 256 addition operations! 
Fortunately the CCK code set is well defined and possesses only 64 valid 8 chip 
component correlations. Hence, the final layer, illustrated in FIG. 7, is pruned to 
perform only 64 unique addition operations. Thus, a total (upper bound) number 
of adders used for the algorithm is: 

16 (first hierarchical layer) + 32 (second layer) + 64 (third layer) = 112 

[0077] This is a remarkable result because a conventional parallel matched filter 

or correlator bank would require 448 complex additions. Theoretically, 1 12 is the 
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upper bound. However, in practice, the Trellis may be pruned to a maximum of 
78 additions on the I and 78 and the Q. 
[0078] FIG. 3C illustrates an example signal flow diagram for the FCT algorithm 

through the first 2 layers 304 and 306. In accordance with the example above, 
there are 8 input samples and 32 output correlation options through the first 2 
layers 304 and 306. Correlation combinations from the upper and lower 16 4- 
tuples provide a final trellis with 64 8-tuple options, each representing a different 
CCK symbol. The option having the highest output is selected during each 
symbol cycle as. the most likely CCK symbol. In the example embodiment, the 
correlation operation utilizes I and Q matched filters since the symbols are 
complex. 

IV. Mathematical Formulation 

[0079] In an embodiment, a receiver in accordance with the present invention 

receives a waveform and an analog-to-digital converter function samples the 
received waveform and converts the waveform down to baseband. The received 
sampled waveform may be represented from Eq. 5: 

+ (Eq.5) 

[0080] Where S,- represents samples from the received signal and N, represent 

noise sampled from an average white Gausian noise ("AWGN") process. This 
equation does not account for multi-path. The samples can be considered as 
complex, with I and Q components. The receiver output can be represented by Eq. 
6: 

Y,=£C k X(i-k) (Eq.6) 

k=0 
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[0081] The coefficients, C k , are considered constant for the nominal case of an 

AWGN channel, "n" is the FIR filter or correlator depth. For a case of m 
correlators operating on Xj in parallel, Eq. 6 becomes Eq. 7: 

Y^EC^XO-k) (Eq.7) 

k=0 

[0082] The mth correlator branch then contains correlator coefficients uniquely 

related to that branch. 

[0083] FIG. 4A illustrates a conventional parallel correlator approach and relates 

it to Eq. 7. The present invention breaks the sum over n-1 into smaller sums, 
typically, although not necessarily, sums of 2. The present invention applies all, 
or substantially all potential cross correlations and carries the 4 results forward to 
a subsequent level of processing. An example mathematical formulation of this 
- operation is provided in Eq. 8; 

k=0 k=2 k=4 k=6 

[0084] Where ^,v,p, and u may be considered as indices to select different 

coefficients. All indices should be considered in terms of a final valid code word 
correlation. In the 802.11 case, 256 correlation sequences are defined by Eq. 8, 
but the options are sifted to correspond only to valid CCK code words. FIG. 4B 
illustrates an example matrix form of coefficients for a parallel correlator 
according to the present invention. 
[0085] The coefficients all take on the values of +/-1 for the examples herein. 

The indices are permitted to contain zeros for consistency with the original FIR 
formulation. The FCT sub-matrices however are simply; 

C M =C k>v =C kiP =C M =[V;] (Eq.9) 
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[0086] The indices ^,v,p,u are manipulated specifically to match the 

coefficients to the desired code words at the Y(i), vpu outputs. The indices also 

uniquely specify the trajectory through the signal flow path. Breaking up the 
original parallel matched filter coefficient matrix into the smaller 2x2 matrix 
permits the FCT algorithm to exploit redundant correlation operations. 

V. Comparisons to the Hadamard Transform 

[0087] An FCT algorithm trellis, in accordance with the present invention, is 

described above. A corresponding length 4 complex fast Hadamard Transform is 
illustrated in FIG. 5. As with the FCT algorithm, there are two such trellis 
structures corresponding to 8 input samples and 32 outputs. The 32 outputs 
include two 16 wide 4-tuple groupings, which are then utilized in combinations to 
produce the final 54 8-tuple correlations. 

[0088] There are distinct differences between the two algorithms. For example, 

the FCT algorithm can function on arbitrary correlation words, even in matched 
filter scenarios, while the FWT requires certain signal coding structure. Also, the 
FCT algorithm permits greater efficiency because the number of adds may be 
tailored for a specific code set or application. 

[0089] Harris and Lucent advertise an efficiency of 112 complex additions for 

CCK demodulation, which amounts to 2x112 additions. The bounding case for 
64 arbitrary correlations with the FCT was shown to be 112, maximum. It turns 
out that the specific case of CCK may be accommodated using a pruned FCT 
algorithm having 78 additions on the in-phase correlation band and 78 additions 
on the quadrature correlation bank. An additional 36 add/subtract operations are 
required to complete the demodulator, for a total of 114 complex operations, 
versus 112 for the FWT. Additional reductions can be obtained to reduce the 
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demodulator to approximately 100 complex add/subtract operations, as described 
below. 

VI. Maximum Likelihood Decoding (AWGN, no Multipath) 

[0090] The modified Fast Walsh/Hadamard Transform implements a complex 

trellis decoder for which maximum scores may be compared at the trellis output. 
Thus there are 64 distinct outputs of the trellis which are preferred based on 
Euclidean distance calculations along the trellis trajectories. Only certain specific 
trajectories are considered up through the second tier of the trellis. The distance 
properties for the decoding trellis are also Euclidean for the in-phase and 
quadrature phase correlators. However, it is important to realize that the total 
distance should be calculated within the complex plane rather than simply on I 
independently of Q. That is, scoring is based on Eq. 10. 

Distance ^ V'score'+Qscore 2 (Eq. 10) 

[0091] This comes from the fact that there are pairs of I and Q chip code words 

which are dependent. That is the nature of the complex Walsh-Hadamard codes. 
Fortunately, a sufficient statistic exists which does not require the square root 
operation. Simply calculating the sum of the squares or estimating the vector 
magnitude will suffice. In this manner then the total distance or weighting 
through complex space is calculated. The largest output out of the 64 complex 
operations (weighting scenario) then becomes the most likely 8 chip complex 
code. 
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A. Magnitude Comparator 

[0092] In order to determine which code symbol was most likely encoded, a 

magnitude compare operation is performed on the outputs from the summer 102 
(FIG. 1). A variety of types of magnitude compare operations can be performed. 

[0093] FIG. 6 illustrates an example parallel magnitude compare operation in 

accordance with an aspect of the invention. In operation, the I and Q inputs, 8 
bits wide each, for example, are squared and summed at the correlator outputs to 
form 64 scores. These 64 scores are compared and the largest result selected as 
the maximum likelihood symbol estimate. Each of the 64 outputs are assigned 
their corresponding chip code-to-6-bit data map. An additional di-bit is decoded 
from the differential phase decoder. In an embodiment, the squaring operation 
results in 16 bit output value when the inputs from each I and Q correlator are 
truncated to an extent . reasonable to minimize the compare tree. In an 
embodiment, a parallel compare tree utilizes log2(64)-l compares to obtain the 
most likely result. 

[0094] In an embodiment, the magnitude compare operation illustrated in FIG. 6 

utilizes a flag at each level of compare to indicate the winning local score at that 
level. The winning local score can be traced from the output back to one of the 64 
original input correlation scores to decide which 6-bit word is most likely. In an 
embodiment, outcomes of tie scores at one or more levels are arbitrarily 
determined. In an embodiment, magnitude compare operations are performed 
with an adder/subtractor to create the result C=A-B, where A and B are inputs. 

[0095] Another magnitude compare technique that can be utilized is referred to 

herein as space slicing, which includes the steps of examining the MSB of the 
correlator outputs, and throwing away results not active in the MSB. If none are 
active in the MSB then the next MSB is compared, so on and so forth. Any 
surviving correlator outputs are compared in the next most significant MSB in 
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succession until all necessary compares are exhausted. This technique is useful 
because it requires only 1-bit compares at each level down to the final compare. 
In an embodiment, 1 bit compares are performed with an exclusive OR gate. 
Generally, there is no deterministic way to predict the number of surviving 
compares which may be passed on to the next level, but the maximum number 
typically reduces by a factor of 2 at each level. This approach relies on a 
statistical distribution of scores, which may permit rapid sifting. If all of the 
distances are similar in magnitude then sifting typically requires more operations. 
For instance, if all 64 distance calculations/scores possess an active MSB then the 
first round of sifting will not eliminate any scores and all scores are then be 
compared in the next MSB. Although this is not likely to occur, it should be 
anticipated for associated hardware realization. 

VII. Example Methods for Implementing the FCT Algorithm 

[0096] FIG. 8 illustrates an example process flowchart 800 for implementing the 

FCT algorithm in accordance with an aspect of the present invention. For 
illustrative purposes, the flowchart 800 is described herein with reference to one 
or more of the drawing figures described above. The invention is not, however, 
limited to the examples illustrated in the drawings. Based on the description 
herein, one skilled in the relevant art(s) will understand that the invention can be 
implemented in a variety of ways. 

[0097] The example process flowchart 800 illustrates a method for correlating an 

encoded data word (Xo-X M -i) with encoding coefficients (Co-Cm-i), wherein each 
of (Xo-Xm-i) is represented by one or more bits and each said coefficient is 
represented by one or more bits, wherein each coefficient has k possible states, 
wherein M is greater than 1 . 
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[0098] The process begins with step 802, which includes multiplying X 0 with 

each state (C 0 (o) through C 0 (k-i)) of the coefficient Co, thereby generating results 
X 0 Co(o) through XoCo(k-i). This is illustrated, for example, in FIGS. 3 A, 3B, and 
3C just prior to the kernels 302A, B, C, and D. 

[0099] Step 804 includes repeating step 802 for data bits (Xi-X M -i) and 

corresponding coefficients (C)-Cm-i), respectively. This is illustrated, for 
example, in FIGS. 3B, and 3C just prior to the kernels 302E through 302Q. 

[00100] Step 806 includes grouping the results of steps 802 and 804 into N groups 
and summing combinations within each of said N groups, thereby generating a 
first layer of correlation results. This is illustrated, for example, in FIGS. 3A, 3B, 
and 3C by the kernels 302, and the resultant first layer of results 307. 

[00101] Step 808 includes grouping the results of step 806 and summing 
combinations of results within each group to generate one or more additional 
layers of results, and repeating this process until a final layer of results includes a 
separate correlation output for each possible state of the complete set of 
coefficients (C 0 -C M -i). This is illustrated in FIG. 3C and FIG. 7, where the 
summers 306 generate a second layer 310, the FCT final output trellis 702 (FIG. 
7) provides separate outputs for each possible state of the complete set of 
coefficients (Co-Cm-i) in a final layer 704. 

[00102] In an embodiment, steps. 806 and 808 include the step of omitting 
summations that would result in invalid combinations of the encoding coefficients 
(Co-Cm-i). This is illustrated in steps 806A and 808A. This also is illustrated, for 
example, in FIG. 7, wherein the second level of results 310 omits the following 
combinations: 
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C 4 CsC6(-C 7 ); 

C 4 C 5 (-C 6 )(-C 7 ); 

(-C 4 )C 5 C 6 (-C 7 ); 

(-C 4 )C 5 (-C 6 )(-C 7 ); 

C 4 (-C 5 )C 6 (-C 7 ); 

C 4 (-C 5 )(-C 6 )(-C 7 ); 

(-C 4 )(-C 5 )C 6 (-C 7 ); and 

(-C 4 )(-C 5 )(-C 6 )(-C 7 ). 

[00103] In this example, the omissions eliminate performing summations for 
combinations that are invalid in light of the CCK code or that result in null 
operation. In other embodiments, different combinations may or may not be 
omitted based on particular codes. 

[0100] Step 810 includes comparing magnitudes of said separate correlation 

outputs, thereby identifying a most likely code encoded on said data word. This is 
illustrated, for example, in FIG. 6, by the example parallel magnitude compare 
operation. 

[0101] In an embodiment, the process flowchart 800 further includes the step of 

performing steps (1) through (5) using substantially the same hardware for in- 
phase and quadrature phase components of the data word (Xo-Xm-i). 

[0102] In an embodiment, the coefficients (Co-Cm-i) represent real numbers. In 

an alternative embodiment, the coefficients (Co-C M -i) represent complex numbers. 

[0103] In an embodiment, the coefficients (Co-C M -i) are represented with a single 

bit. Alternatively, the coefficients (Co-Cm-i) are represented with multiple bits 
(e.g., magnitude). 

[0104] In an embodiment, the coefficients (Co-C M -i) represent a cyclic code 

keying ("CCK") code set substantially in accordance with IEEE 802.11 WLAN 

standard, illustrated in the tables below. 
[0105] In an embodiment, as illustrated in one or more of the prior drawing 

figures, M equals 8, N equal 4, and the coefficients (Co-Cm-i) have two states, 

plus and minus. 
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VIII. CCK Chip Code Words 

[0106] Tables are provided below that illustrate source-input data symbols, 8-bits 

long (d 0 -d 7 ), and corresponding in-phase and quadrature phase 8 chip symbols 

used for CCK. The complex chip notation is provided for reference. In addition 
the algorithm flow diagram 4-tuple sums are provided since the last level of flow 
diagram becomes complex and difficult to follow. B 0 ...B 31 are the 4-tuple 

intermediate correlation results relating to the signal flow diagrams presented for 
the correlator algorithm. Two branch 4-tuples form the final output result for 
each correlator. B 0 . . .B 15 provide options for the first branch component to form 

a final output correlator 8-tuple while B 16 ...B 31 provide the second branch 

component or second 4-tuple. For instance, Table 1 illustrates an example build- 
up: 



Table 1 



4-tuple Designator 


4-tuple Coefficient Sequence 


B 6 


-1,1,1,-1 


B28 


1,1,-1,-1 


4-tuple Combination 
B 6 +B 28 => 


Final 8-tuple Correlator Coefficient Sequence 
-1,1,1-1,1,1-1-1 
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[0107] Logical zeros become weighted by an arithmetic value, -1. In this 

manner the optimum correlator trajectory for a particular chip sequence is 
projected through the correlator trellis. The example above corresponds to the in- 
phase correlator waiting for an originally transmitted data sequence 
d 0 -d 7 of 0,0,1,0,1,0,1,0 . For this example, that particular branch represents a 

correlation provided in Eq. 1 1 ; 

Y42 =X 0 (-1) + X 1 (1) + X 2 (1) + X 3 (-1) + X 4 (1) + X 5 (1) + X 6 (-1) + X 7 (-1) (Eq. 11) 

[0108] x 0 ...x 7 represent corrupted or noisy input signal samples. When the x l 

match the coefficients significantly then that 8-tuple (1 of 64) output possesses 
maximum weighting and is declared most, likely by the magnitude comparator. 
.Another strategy seeks to minimize the distance between the X, and c, . The 

process is similar in either case. 
[0109] Table 2 illustrates example in-phase and quadrature 4-tuple combinations. 

It is noted that the examples provided in Tables 1 and 2 are provided for 
illustrative purposes and are not limiting. Other specific examples will be 
apparent to persons skilled in the relevant arts based on the teachings herein, and 
such other examples are within the scope and spirit of the invention. 



Table 2 



dOdl d2 d3 
d4d5 d6d7 


In phase 


' 4 i tuple 
Combination 


Quadrature 


4-tuple 
Combination 


Complex 


Do 


00000000 


11101101 


B2 + B20 


11101101 


B2 + B20 


111-111-11 


D, 


00000001 


00011101 


B !3 + B 2 0 


11101101 


B2 + B 2 o 


jjj-jll-11 


D 2 


00000010 


00011101 


B 13 + B 2 o 


00011101 


B13 + B 2 o 


-1-1-1111-11 


D 3 


00000011 


11101101 


B 2 + B20 


00011101 


B13 + B 2 o 


-j-j-jjll-11 


D 4 


00000100 


00100001 


Bu + B23 


11101101 


B 2 + B20 


jjl-ljj-11 


D 5 


00000101 


00010001 


B 13 + B^ 


00101101 


B| 4 + B 2 Q 


-1-lj-jjj-ll 



-25- 



dOdl d2d3 
d4 d5 d6 d7 


In phase 


4-tuple 
Combination 


Quadrature 


4-tuple 
Combination 


Complex 


D 6 


OOOOOllO 


11010001 


B, + B 23 


00011101 


B13 + B 2 q 


-j-j-lljj-11 


D 7 


00000111 


11100001 


B 2 + B 23 


11011101 


B, + B 20 




D 8 


00001000 


00100001 


Bj4 "t" B 2 3 


00100001 


Bj4 + B 23 


-1-11- 1-1- 1-11 


D 9 


00001001 


11010001 


Bj ~f~ B2 3 


00100001 


B|4 B 23 


-j-jj-j-1-1-11 


D, 0 


00001010 


11010001 


B, + B 23 


11010001 


B, + B 23 


11-11-1-1-11 


D„ 


00001011 


00100001 


B|4 4* B23 


11010001 


B, + B23 


jj-jj-1-1-11 


Di 2 


00001100 


11101101 


B2 4* B20 


00100001 


B14 + B23 


-j-jl-l-j-j-11 


D 13 


00001101 


11011101 


Bj + B20 


11100001 


B 2 + B 23 


llj-j-j-j-11 


D,4 


00001110 


00011101 


B13 4* B 2 q 


11010001 


B1 + B23 


jj-ll-j-j-11 


D 15 


00001 1 1 1 


00101101 


Bj4 B20 


00010001 


B|3 B23 


-1-1-jj-j-j-l 1 


D 16 


00010000 


01000111 


B 7 + B 17 


11101101 


B 2 + B 20 


jlj-ljl-jl 


Dn 


00010001 


00010111 


B|3 Bj7 


01001101 


B7 "t" B20 


-lj-1-jjl-jl 


P,g 


00010010 


10110111 


Bg "I - Bj7 


00011101 


B13 4" B20 


-j-l-jljl-jl 


D 19 


00010011 


11100111 


B2 4" B|7 


10111101 


B 8 B20 


1-jljjl-jl 


D20 


00010100 


00000011 


B|5 B|9 


01100101 


B 6 + B 2 j 


-ljj-l-lj-jl 


D21 


00010101 


10010011 


B9 + Bj9 


00000101 


B15 4" B21 


-j-l-l-j-lj-jl 


D22 


00010110 


11110011 


Bq + Bj9 


10010101 , 


B9 + B21 


1-j-jl-lj-jl 




00010111 


01100011 


B§ + B19 


11110101 


Bo + B21 


jllj-lj-jl 


D 24 


00011000 


10001011 


Bn .+ B 18 


00100001 


Bj4 + B23 


-j-lj-l-j-l-jl 


D 25 


00011001 


11011011 


Bi + B| 8 


10000001 


Bu + B23 


1-j-l-j-j-l-jl 


D 26 


00011010 


01111011 


B 4 + B 18 


11010001 


B,+B23 


jl-jl-j-l-jl 


D 27 


00011011 


00101011 


Bi4 + B| 8 


01110001 


\ 

B4+ B 23 


-ljlj-j-l-jl 


D 28 


00011100 


11001111 


B 3 + B I6 


10101001 


Bio B22 


1-jj-ll-j-jl 


D29 


00011101 


01011111 


B 5 + B I6 


11001001 


B 3 + B M 


jl-l-jl-j-jl 
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dO dl 62 d3 
d4 d5 66 61 


In phase 


4-tuple 
Combination 


Quadrature 


4-tuple 
Combination 


Complex 


D 30 


OOOllllO 


00111111 


B|2 + Big 


01011001 


B 5 + B 22 


-lj-j 1 1 -j-j 1 


D 3I 


0001 1111 


10101111 


Bjo + Bjg 


00111001 


Bj2 B22 


-j-lljl-j-jl 


D 32 


00100000 


01000111 


B 7 + B, 7 


010001 11 


B 7 + B, 7 


.-11-1-1-1111 


D33 


00100001 


10110111 


Bg + Bj7 


01000111 


B 7 + B 17 


-jj-j-j-1111 


D34 


00100010 


10110111 


Bg + B17 


10110111 


Bg Bj7 


1-111-1111 


D 35 


00100011 


01000111 


B 7 + B l7 


10110111 


Bg *t" Bj7 


j-jjj- 1111 


D 36 


00100100 


10001011 


B| 1 + Bjg 


01000111 


B7 "4" Bj7 


-jj-i-i-jjn 


D 37 


00100101 


10111011 


Bg + B]g 


10000111 


Bjj "4" Bj7 


1-1-j-j-jjii 


D 38 


00100110 


01111011 


B4 + Bjg 


10110111 


Bg *t* Bj7 


j-jii-jjii 


D 39 


00100111 


01001011 


B 7 + Bjg 


01110111 


B4 + B.7 


-iijj-jjn 


D 40 


00101000 


10001011 


B]j + Bjg 


10001011 


B 1 1 "4* Bjg 


1-1-1-11-111 


D 4 » 


00101001 


01111011 


B4 "4* Bjg 


10001011 


D 111 fcj 1 0 


j-j-j-j 1-111 


D 42 


00101010 


01111011 


B4 Bjg 


01111011 


B4 Bjg 


-11111-111 


D 43 


00101011 


10001011 


Bj j + Bjg 


01111011 


B4 "J - Bjg 


-jjjji-ni 


D44 


00101100 


01000111 


B7 + B17 


10001011 




j-j-i-ij-jii 


D45 


00101101 


01110111 


B 4 + B 17 


01001011 


B7 "4* Bjg 


-11-j-jj-jii 


D46 


00101110 


10110111 


Bg + Bj7 


01111011 


■ B4 Bjg 


-jj 11 j-j 1 1 


D47 


00101111 


10000111 

* 


Bu + B17 


10111011 


Bg + Bjg 


i-ijjj-jii 


D 48 


00110000 


11101101 


B2 + B20 


01000111 


B 7 + B 17 


-ji-j-i-jiji 


D49 


00110001 


10111101 


Bg + B20 


11100111 


B 2 + B 17 


iji-j-j-ji 


D50 


00110010 


00011101 


B3 + B20 


10110111 


B 8 + Bp 


j-iji-jiji 


D 5 , 


00110011 


01001101 


B 7 + B 2 o 


00010111 


B 3 + B 17 


-i-j-ij-jiji 


D52 


00110100 


10101001 


Bio + B22 


11001111 


B 3 + B l6 


lj-j-lljjl 


D53 


00110101 


00111001 


Bj2 + B22 


10101111 


Bio + B|6 


j-11-jijji 
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d0dld2d3 
d4 d5 d6 d7 


In phase 


4-tuple 
Combination 


Quadrature 


4-tuple 
Combination 


Complex 


D54 


00110110 


01011001 


B 5 + B 22 


00111111 


B12 + Bi 6 


-1-jjlljjl 


D 55 


00110111 


11001001 


B 3 + B 22 


01011111 


B 5 + B I6 


-jl-ljljjl 


D 56 


00111000 


00100001 


Bu + B 23 


10001011 


B,i + B t8 


j-l-j-lj-ljl 


D57 


00111001 


01110001 


B 4 + B 23 


00101011 


B l4 +Bi8 


-1-jl-jj-ljl 


D 58 


00111010 


11010001 


B, + B 23 


01111011 


B4 + Big 


-jljlj-ljl 


D 59 


001U011 


10000001 


Bu + B 23 


11011011 


B, + B, 8 


lj-ljj-ljl 


Deo 


00111100 


01100101 


B 6 + B 2l 


00000011 


B, 5 + Bj 9 


-1-j-j-l-l-jjl 


D 6I 


00111101 


11110101 


Bo + B 2! 


01100011 


B6 + B19 


-jll-j-l-jjl 


D 62 


00111110 


10010101 


B 9 + B 2 i 


11110011 


Bq + B19 


ljjl-1-jjl 


D 63 


00111111 


00000101 


B15 + B 21 


10010011 


B9 + B19 


j-l-lj-l-jjl 
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IX CCK Decoder 

t 

A. Introduction 

[0110] A fast correlator transform (FCT) kernel in accordance with the invention, 

can be used as a building block for a CCK decoder. For example, a FCT kernal 
can be applied and used to decode data using the -IEEE 802.11b 11 Mbps 
signaling scheme. For the design, two correlators are preferably used. Since the 
signaling scheme is complex one correlator is used for the in-phase (I) channel, 
and another correlator is used for the quadrature phase (Q) channel. The input to 
each channel is the designated codeword for the transmitted data. Table 3 
illustrates example input data and corresponding coded output. 



Data (input) 


Inphase (output) 


Quadrature 
(output) 


00000001 


00011101 (B13+B20) 


11101101 
(B2+B20) 



Table 3: Input Data and CCK Coded Output 



[0111] The CCK coded data for each channel is passed into its respective 

correlator for processing. The output of a branch correlator trellis preferably has a 
maximum value of 8 (with no noise in the system) that corresponds to a 
correlation with the input codeword. The value can be a +8 depending on the 
phase. The value is then magnitude squared, which will give one branch of the 
trellis a maximum value of 64. The in-phase and quadrature branches of the I and 
Q correlators are then paired together according to the CCK codeword in Table 2, 
above. These pairs, when combined, provide a maximum I 2 +Q 2 value of 128. 
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[0112] The CCK decoder works fine when there is a 0 or n phase rotation. A 

problem arises, however, when there is a n/2 or a 3n/2 phase rotation. The 
problem is described with the following example. 

[0113] The base codeword Table 2, above, for the 11Mbps CCK encoding 

scheme gives the pairs of codewords that correspond to a given input message. 
For 56 out of the possible 64 codewords, the encoded words on the I and Q 
channel are similar, except that they are on opposite channels. An example of two 
codewords are shown in Table 4 below. 



Data (input) 


Inphase (output) 


Quadrature (output) 


00001100 


11101101 (B2+B20) 


00100001 (B14+B23) 


00000100 


00100001 (B14+B23) 


11101101 (B2+B20) 



Table 4: CCK Input and Encoded Data 



[0114] The codewords for the two data symbols given in Table 4 are similar, but 

are on different channels. So if there is a n/2 or a 3n/2 phase shift from 
differentially encoding the transmitted data, then the I and Q channels will be 
swapped. This is illustrated in FIG. 9, which is a signal path diagram for an 
example CCK decoder output trellis, including an I signal path 902 and a Q signal 
path 904. 

[0115] If the codewords received on the I signal path 902 and the Q signal path 

904 are 11101101 and 00100001, respectively, then the maximum output will be 
8 and will follow paths 906a and 906b, respectively. But if the codewords 
received on the I signal path 902 and the Q signal path 904 are swapped, then the 
maximum output will be 8 and will follow paths 908a and 908b, respectively, 
which correspond to different input data than paths 906a and 906b. This is 
acceptable if that is the intended path. But if there is a n/2 or a 3n/2 phase 
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rotation, then the input words to the correlator are swapped on the I and Q 
channels, and the I or Q channel is inverted depending on what the phase rotation 
was. For these cases, if the input codewords before differential encoding 
correspond to the paths 906a and 906b, then the decoded data should be that 
which is associated with the paths 906a and 906b, not the paths 908a and 908b. 
Since the differential decoding process is typically performed after or during the 
correlation process, then the phase rotated symbols enter the correlator in that 
form. A determination should be made as to whether there was a phase rotation. 
Phase rotation can be discerned as follows. If there was a nil phase shift, the in- 
phase channel will be inverted. If there is a 37t/2 phase shift, the quadrature phase 
channel will be inverted. 
[0116] FIG. 10 is a block diagram of an inverted channel detection circuit 1000 

that can be used to determine if one of the channels was inverted. If both 
channels were inverted then there was a n phase shift and the data is decoded 
correctly, so this is of no consequence to the system. 

X. CCK Correlator Optimizations 

A. Introduction 

[0117] A CCK parallel correlator in accordance with the invention can be 

optimized to reduce the number of adders in a Fast Correlator Transform (FCT) 
kernel. An initial complex correlation is presented below, followed by various 
optional optimizations that reduce the complexity of the initial complex 
correlation. 

! - 

s 

[0118] Table 6 lists 64 complex codewords that have coefficients with four 

possible values of +1, -1, +j, and -j. This means that either the real part or 
imaginary part of each coefficient is 0, but not both. Table 6also lists 
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corresponding codewords rotated by 45° to get the I,Q representation. Note that 

each coefficient in the I,Q representation has a magnitude of 4l since the four 
possible coefficients are now +l+j, +l-j, -1+j, and -1-j. Rotation of the 
coefficients is the key to simplification of the FCT. 
[0119] Additional simplification is achieved by realizing that some intermediate 

results are the negative of other intermediate results at the same stage. 

B. Initial Complex Correlation 

[0120] An initial complex correlation of complex CCK codewords with complex 

inputs can be achieved from equation 12: 



y(n) = £c(k)x(n-k) (Eq. 12) 

where c(k) = c 7 (k) + jc Q (k) and x(k) = x { (k) + jx Q (k). 
[0121] Expanding out the complex product results in equation 13: 



7 



JW) *=0 



7 



>0 fcO 



(Eq. 13) 



[0122] FIG. 11 is a block diagram 1100 of the complex correlation of equation 

12. 

[0123] The resulting complex correlation of equation 12 includes four real 

correlates performed in parallel. Implementing the correlate equations of 
equation 12 results in 4x7+2=30 additions per codeword. For the 64 CCK 
codewords there are 64x30=1920 additions to implement all the required 
correlations. Since the real or imaginary part of each CCK complex coefficient is 
zero, the computation is reduced to 1 920-(64x2x7)=l 024. 
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C. Fast Correlator Transform (FCT) 

[0124] A Fast Correlator Transform (FCT) can be used to reduce the computation 

by assuming the use of 45° rotated I,Q codewords shown in Table 6, below. To 
derive the 64 parallel correlates for all of the codewords, two substantially similar 
FCTs are preferred. One is for the real part of the input. The other is for the 
imaginary part of the input. FIG. 12 is a block diagram of an example FCT 1200. 
For a CCK with codeword size of 64 the FCT 1200 can be divided into several 
stages. For example, FIG. 13 illustrates a FCT 1300 that includes stages la 
(1302), lb (1304), 2a (1306), 2b (1308), and 3 (1310), which are described below. 

[0125] FIG. 14 is an example block diagram of stage la of the FCT 1300. In the 

example of FIG. 14, the stage la includes 2 additions and 6 subtractions, which 
implement the following: 

D 0 =x 0 +x l9 D x =-x 0 +x, , D 2 =x Q -x 15 Z) 3 =-x 0 -x x 

D 4 =x 2 +x 3> D 5 = -x 2 +jc 3 , D 6 =x 2 -x 3 , D 7 =-x 2 -x 3 . 
[0126] FIG. 15 is an example block diagram of stage lb of the FCT 1300. In the 

example of FIG. 15, the stage lb includes 8 additions and subtractions, which 
implement the following: 

D B =x 4 +x 5 , D 9 = -x 4 +x 5 , D x0 =x 4 -* 5 , D u =-x 4 -x 5 

D X2 =x 6 +x 79 D u =-x 6 +x 7 , D u =x 6 -x 7 , D X5 =-x 6 -jc v ' 
[0127] FIG. 16 is an example block diagram of stage 2a of the FCT 1300. In the 

example of FIG. 16, the stage 2a includes 16 additions, which implement the 
following: 

B 0 =Z> 0 +Z) 4 , B x =jD 0 +Z) 5 , B 2 =D 0 +D 6 , B 3 =D 0 +Z) 7 
B 4 =D, +i) 4 , B 5 =D X +D S , B 6 =D X +Z) 6 , B 7 =D X +Z) 7 
B S =D 2 +D 4 ,B 9 =D 2 +D 5 ,B X0 =D 2 +D 69 B U =D 2 +D 7 
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B n = D 3 +D 4 , 5 U = D 3 +D S> B lA = D 3 +D 6 , 5 15 = £> 3 +£> 7 . 
[0128] FIG. 17 is an example block diagram of stage 2b of the FCT 1300. In the 

example of FIG. 17, the stage 2b includes 16 additions, which implement the 
following: 

5 16 = D s + D n , B l7 = D 9 + D X2 , B n = D w + D i2 ,B i9 =D u + D n 

B 20 = D s + £> 13 , 5 2 , = D 9 + D l3 , B 22 = D l0 + D u ,B 23 =D U + D x3 

B 24 = D t + D X4 , B 25 = D 9 + D X4 , B 26 = D xo + D u , B 21 = D XX + D X4 

B 2t =D t +D i5 , B 29 =D 9 + D X5 , B 30 =D X0 + D X5 , B 3X = D XX + D X5 
[0129] Stage 3 of the FCT 1300 includes 64 additions for the I values and 64 

additions for the Q values, as follows: 

I 0 =B 2 +B 20 , I x =B l3 +B 20 , I 2 = B X3 +B 20 , I 3 =B 2 +B 20 
Q 0 =B 2 + B 2Q , Q x =B 2 + B 20 , Q 2 = B X3 + B 20 , Q 3 = B i3 + B 20 
h =B i4 +B 23 , I 5 =5 n +B 23 , I 6 = B X + B 23 , I y =B 2 +B 23 
Q 4 =B 2 +B 20 , Q 5 = B X4 +B 2Q , Q 6 =B l3 +B 20 , Q 2 = B x +B 20 

h ~ B 14 + B 23 > h = B \ + B 23 ' ^10 = B \ + B 23 > hi = B 14 + B 23 
Qi ~ B \4 + B 23 > 09 = B \4 + B 23 > QlO = B \ + B 23 » 2ll = B \ + B 23 

I X2 =B 2 +B 20 , 7,3 = B X +B 20) I X4 =5,3 +B 20 , I xs =5, 4 +5 20 

Q\2 =B 14 + 5 23>£?13 = 5 2 + 5 23>£?14 = B \ + B 23' QlS ~ B \3 + B 23 

he =B i +B v> hi ~ B \3 + B n> hs = B % + B \i> h<> ~ B 2 + B \i 
Q x6 =B 2 +B 20 ,Q ll =B 7 +B 20 , Q Xi =B x3 +B 20 , Q X9 =5, +B 20 

ho ~ B \S + ^19> h\ ~ B 9 + -#19> h2 ~ B 0 +B )9> h3 ~ B 6 +B 19 
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^20 = B 6 +^21> Ql\ = B \S + B 2\^22 = B 9 + B 2\ > 023 = 5 0 + B 2\ 
J 24 =5 11 + ^18> J 2S =B \ + 5 18> ^26 = 5 4 + 5 18> ^27 = 5 14 + ^18 
£?24 = S \4 + S 23>Q2S + B 21>Q26 = 5 I + 5 23 > ^27 = 5 4 + 5 23 

^28 = ^3 + ^16> ^29 =^5 + 5 16> Ao =5 12 + 5 16> Al = B \0 + B \6 
(2 2 8 ==B \0 + ^22'!229 = ^3 + 5 22>£?30 =^5 + ^22>£?31 = 5 12 + ^22 

/ 3 2 =5 7 +2? l7 , / 33 = 2? B -+2?, 7 , 7 34 = 5 8 +5 17 , I 3S =B 7 + 5 I7 

032 = 5 7 + #17 » ^33 = 5 7 + ^17 > £?34 =■ ^8 + B \7 > 035 = ^8 + ^17 
^36 = 5 11 + 5 18 > ^37 = ^8 + B \l 9 As = B 4 + ^18 ^39 = ^7 + 5 18 
036 = 5 7 + B \l>Qn = ^11 + ^17 > 038 = 5 8 + ^17 > 039 = 5 4 + B \l 
^40 = #11 + #18 > ^41 = #4 + #18 > A2 = B 4 + #18 > ^43 = #11 + #18 
040 = #il + #i8>041 = B U + #18>042 = #4 + #18> 043 = #4 + #18 
I44 = #7 + #17 > ^45 = #4 + ^17 > ^46 = B \\ + #17 > ^47 = B \\ + #17 
044 = #11 + #18>045 =B 7 + #i8> Q46 = B 4 + #18> 047 = #8 + #18 
^48 = #2 + #20 > ^49 = B Z+ B 20 > ^50 = #13 + B 20 > ^51 = #7 + B 20 
048 = #7 + #17 > 049 = B 2+ B \l > QsO = ^8 + 5 17 » ftl = ^13 + 5 17 
^52 =5 10 +jff 22> ^53 ~ B \2 + 5 22 » I S4 =B 5 + B 22 > hs = B 3 + B 22 
Qs2 = ^3 + ^16 » 2s3 = ^10 + B \6 ' 054 = B \2 + ^16 > ^55 = ^5 + ^16 
7 56 =5 14 +5 23» A7 =^4 +5 23» ^58 = B \ + B 22> 1 59 = B \\ + B 23 
Qs6 = B U + 5 18>£?57 = B \4 + B \Z> Qs* =B 4+ B \%> Qs9 = B \ + ^18 
^60 = B 6 + B 2\y h\ = 5 0 + 5 2l » A2 = 5 9 + ^21 » ^63 = 5 15 + 5 21 
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Qm = B \S + B \9>Qt\ = B 6 + 5 I9> £?62 = B 0 + 5 19> £?63 = B 9 + B \9 

[0130] Stage 3 can be optimized by noting that there are 40 distinct values for I 

and Q which are 

4) = B 2 + B 2Q > A ~ B \3 + ^20 ? A 2 = B \4 + ^23 > A = ^13 + ^23 

A 4 = B u + B 20 , A 5 = B x + 5 23 , 4 = 5 2 + 5 23 , 4 = 5, + £ 20 
A % = 2? 7 + £ l7 , 4, = 5 13 + B X1 , yi 10 = 5 7 + B 20 , = Z? 8 4- i? 17 

42 ~ B 2 + ^17 > ^13 = B Z + 5 20 > A* = B \5 + 5 I9 > 4s 55 5 6 + B 2\ 
A 16 = B 9+ B \9 > A \7 = B \S + 5 21 > 4s = ^0 + B \9 > A \9 = ^9 + 5 21 
^20 = + ^19 > ^21 f ^0 + ^21 » 4>2 = + ^18 > ^23 = ^1 + B \% 
A 24 = ^11 + 5 23 > ^25 " 5 4 + 5 18 > ^26 = 5 14 + ^18 > ^27 = B 4 + ^23 
^28 = ^3 + ^16 > ^29 = ^10 + ^22 ^30 = ^5 + B \6 > ^31 = ^3 + ^22 
^32 = 5 12 + 5 16>^33 =5 5 + #22 > ^34 = B \0 + B \6> As = 5 12 + ^22 x 
4*6 = ^8 + ^18 » ^37 = ^11 + B 17 > ^38 = B l + ^18 > ^39 = ^4 + ^17 ' 

[0131] FIG. 18 is a block diagram of an example system 1800 that calculates the 

40 A n values. The 40 A n values are used to calculate the final 64 output values 
and are described below. 

[0132] Further optimization of the FCT 1300 can be achieved by considering the 

relationship: 

*„=-*,5-„ ,8</i<15 (Eq.14) 
. [0133] The relationship of equation 14 eliminates the need to calculate the eight 

values of B n for 8<h<15, and the intermediate results D 2 and Z) 3 . Thus, 

8+2=10 additions can be removed and be replaced by 10 negates of the samples, 
or subtraction instead of addition at the next processing stage. 
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[0134] Because c 7 is always 1, calculation of B 2A ~-B 2l can eliminate the 

following eight additions: 

B 2A = D s + D XA , Z? 25 = D 9 + Z) I4 , 5 26 = D l0 + D 14 , B 21 = Z>„ + Z) 14 

As = A + A 5 > A 9 = A + As > Ao = Ao + As , Ai = A , + As 

and, consequently, the following two additions: 

A4 = **6 "~ ^7 ' A5 = ~ X 6 ~ X 7 ' 

[0135] Four more additions can be removed by considering that: 

D 6 =-D 5 , D 7 = -Z> 4 , D xo =-D 99 Ai ="A 

[0136] Table 6 shows the 64 correlate outputs as a function of B 0 - B 7 and 

B X6 "B 23 . Note that 40 distinct summations, or subtractions, are used to 

calculate the 128 results in the final stage. FIG. 19 is a block diagram of 
connection stages la, lb, 2a, 2b, and 3 for an example simplified architecture 
, FCT 1900. Note the reduction in states between stages from the FCT 1300 in FIG. 
13 and the FCT 1900 in FIG. 19. The FCT 1900 includes stages la (1902), lb 
(1904), 2a (1906), 2c (1908), and 3 (1910), which are described below. 
[0137] FIG. 20 is an example block diagram of the stage la of the FCT 1900. In 

the example of FIG. 20, stage la includes 4 additions and subtractions, which 
implement the following: 

A = X 2 + X 3 * A - ~ X 2 + X 3 

[0138] FIG. 21 is an example block diagram of the stage lb of the FCT 1900. In 

the example of FIG. 21, stage lb includes 4 additions or subtractions, which 
implement the following: 

A =*4 + *s> A =-*4 +*5 
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D X2 =x 6 +x 1 , D u =-x 6 +x 7 
[01 39] FIG. 22 is an example block diagram of stage 2a of the FCT 1 900. In the 

example of FIG. 22, stage 2a includes 8 additions or subtractions, which 
implement the following: 

B 0 =D 0 +D 4 , B x = D 0 +D 5 , B 2 =D 0 -D 5 , B 3 =D 0 -D 4 

B 4 =D X + D 4 , B 5 =£>, +D 5 , B 6 = D X -D 5 , B n = D X -D 4 

[0140] FIG. 23 is an example block diagram of stage 2b of the FCT 1900. In the 

example of FIG 23, stage 2b includes 8 additions, which implement the 
following: 

B X6 =D S +D X2 , B„=D 9 +D i2 , B xs =-D 9 +D x2 , 

B X9 =-D S +D X2 

B 20 =D S +D n , B 2X =D 9 +D n , B 22 =-D 9 +D u , 

B 23 =-D i+ D n 

[0141] FIGS. 24A and 24B illustrate an example block diagram of stage 3 of the 

FCT 1900. In the example of FIGS. 24A and 24B, stage 3 includes 40 additions 
or subtractions, which implement the following: 

A 0 = B 2 + B 20 > A - ~ B 2 + B 20 > A 2 = ~ B \ + B 23 » A ~ B 2 + B 23 

A 4 =-B x +B 20 , A 5 = B X +B 23 , A 6 = B 2 +5 23 , A i = B x + B 2b 

A = B l + B \l » 4> = ~ B 2 + B \l » A i0 = B 7 +B 20> A\ ~ ~ B 1 + B \l 

A x2 = B 2 + B xl , A Xi = -B n + B 20 , A X4 = -B 0 + B l9 , A X5 = B 6 + B 2X 
A 16 = -B 6 + B X9 , A xl = -B 0 + B 2X , A xs =B Q + B x9 , A x9 = -B 6 + B 2X 
A 2o =B 6 +B \9> A 2\ =B 0 +B 2l , A 22 =-B 4 +B xg , A 2i = B i +B XS 

A 2A ~~ B A + B 23> A 2S = B A + B IS > A 26 = ~ B l + B \t > A 21 = B 4 + B 23 
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A 2S = B 3 +B l6 , A„=-B 5 +B n A i0 =B s +B l6 , A 3l =B, +B 



22 

^32 = + ^16 > 4* = ^5 + S 22 » A 34 ~ + ^16 > 

4 5 =-* 3 +2* 22 

A 26 = + ^18 > ^37 = + S \l > 4*8 = ^7 + ^18 > ^39 = ^4 + ^17 

Final outputs are calculated using 128 additions or subtractions. 

■^70 = ^70 ^00 ' ^00 = ^Q0 ~~ ^70 ' ^71 = ^71 ^00 > ^0 = ^0 ^70 
^72 = ^71 + ^0 > ^£2 = ^0 ~ ^71 > ^73 ~ ^70 + Aq X , = A Q0 - i4 7 , 

J/4 = A n + >^ 0 , y^ 4 = Aq 2 — A IQ , Y J5 = i4 /3 + v4g 4 , 

= ^03 *~ ^74 

■^/6 = + a qx , y^ = — A n , y /7 = A J6 + , 

^07 = ^£6 

y /8 = A I2 + ^ 2 , y^ 8 = ^ 2 — , y /9 = y4 /5 + Aq 2 , 
y G9 = v4g 5 — A I2 

^710 = ^75 + ^£5 » ^00 = ^05 " ^75 > ^711 = ^72 + ^Q5 > 
^01 = ^02 ~^75 

y /12 = y4 /0 + Aq 2 , y ei2 = Aq 0 — ^4 /2 , y /13 = a I7 + ^4 e6 , 

^03 = ^07 ~~ ^76 

^714 = ^71 + ^05 > ^04 = ^0 » ^/15 = ^74 + ^03 > 

^05 ~ ~ ^73 

^716 = ^78 + ^Q0 9 Yq\6 = ^Q8 ~ ^70 ' ^717 = ^79 + » 

^07 = ^9 "^/10 

^718 = ^711 + ^fil > ^08 = ^01 " ^71 * ^719 = ^712 + ^03 > 

^09 = ^012 " ^713 
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^720 ~ ^714 + ^015 > ^020 ~~ ^014 ^715 » ^721 ~" ^716 + ^017 > 



^021 ~~ ^016 A m 



Y f22 - ^4 /18 + A Ql9 , y^ 22 — A QX% A n9 ,. 7 /23 — A /20 + ^4 021 , 



l 722 ^718 ' ^019 
J Q23 ^Q20 /t I2\ 



723 — ^720 ^021 



^724 ~~ ^722 ^02 > ^024 — ^022 ^72 > ^725 — ^723 + ^024 > 



1 724 — ^722 1 ™Q2 '■ 
Y()25 ~ ^023 ~ ^724 



^726 ~" ^725 + ^05 > ^026 "~ ^025 ^75 > ^727 ~~ ^726 + ^027 > 



Y -A -A 

1 Q21 ~ ^026 ^727 



^728 ^728 + ^029 > ^028 " ^028 ^729 > ^729 ~~ ^730 + ^031 > 



1 728 ~ ^728 ^ ^029 
^029 = ^030 ~ ^731 



l 729 _ ^730 r ^031' 



^730 ~~ ^732 + ^033 > ^030 — ^032 ^733 > ^731 — ^734 + ^035 ' 



Y -A - A 

2 031 ~ ^034 ^735 



Y ni — A 1% + Aq S , 
Y -A -A 

^033 ^011 ^78 



^032 " ^08 ^78 9 Y I33 — A m + A Q% , 



^035 



^711 + ^011 9 
^08 ~^711 



^034 ~ ^011 ^711 > ^735 ^78 + ^011 ' 



^736 ~ ^722 + ^08 > ^036 ~ ^022 ^78 > ^737 ~" ^736 + ^037 > 

^037 = 



V22 1 ^08 3 
^036 ^737 



^738 — ^725 + ^011 * ^038 ^025 ^711 > ^ T39 ^738 + ^039 > 



^039 ~ 



A -A 
^038 ^739 



^740 " ^722 + ^022 > ^040 ^022 ^722 > ^741 ^725 + ^022 » 



^041 — 



^025 ^722 



^742 ^725 + ^025 > ^042 "~ ^025 ^725 > ^743 — ^722 + ^025 9 



v = 

^043 



^022 ^725 
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Y/44 " + ^222 9 Yq U — Aq % A I22 , Y I45 — A n9 + A Ql% , 



y — a _ a 

J 045 ~~ ^039 

~ A in +^025> Yq*6 ~ 'Aq\\ ~ ^/25 5 ^747 T ^737 + ^£36 > 

^^47 — ^736 

/ 

^748 = ^70 + Aq % , = ^ 0 ~ ^78 ' ^749 = ^713 + ^Q\2 » 

^049 = ^013 ~ ^712 

^750 = ^71 + ^011 > ^QSQ = Aq\ ^711 ' ^751 ^ ^710 + ^09 > 

^251 = ^00 ~~ 

^752 = ^729 + ^028 » ^£?52 = ^029 ~~ ^28 > ^753 = ^735 + ^034 > 

y — A -A 

J QS3 ~ ^Q35 ^34 

^754 = ^733 + ^032 , ^54 = A Q33 — A n , Y /$5 = A m + v4g 3() , 

^255 = ^031 ~ ^30 

^756 = ^72 + ^(222 > T<256 = ^02 *~ ^22 > ^757 = ^727 + ^026 > 

^£>57 = ^{?27 ~ ^26 

^758 = ^75 + ^£25 » ^58 = ^Q5 ~ ^25 » ^759 = ^724 + 4g23 > 

^059 = ^024 ~ ^23 

^760 ~ ^715 + ^Q\4 > Y<>60 = ^Q\S ~" ^14 > ^761 == ^721 + ^020 > 

^(?61 = ^{?21 ^20 

^762 = ^719 + ^08 > ^262 = ^Q\9 ~~ ^18 » ^763 = ^717 + ^06 > 

Y(>63 = ^07 ~~ ^16 

[0143] The number of adders in the final stage can be reduced by almost half 

from 128 to 72 by sharing adders. The adders are shared between codewords that 
are complex conjugates pairs as listed in Table 5. There are a total of 36 complex 
conjugate pairs where 8 of the codewords are complex conjugates of themselves 
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(real coefficients). Consider, as an example, the pair of codeword 1 and 
codeword 3, from Table 5, with the following output equations: 

J/i = + Aqo* Yq X — A QX — A io (Eq. 15) 

Y n . = A I0 + A QX , Y Q3 = A Q0 - A n (Eq. 16) 

[0144] The four additions, or subtractions, can be reduced to two additions by 

using the architecture shown in FIG. 18. A control signal sets the branches shown 
to opposite signs and the registers are loaded on opposite edges of the clock. 
Outputs include four output values stored in the four registers. This structure is 
repeated 36 times for each of the conjugate a, b codeword pairs in Table 5. 

[0145] To summarize, the total number of complex additions, or subtractions, for 

. the simplified parallel correlator 1900 are: 

Stage #1=8 

Stage #2 = 16 . 

Stage #3 = 40+36 = 76 



TOTAL = 100 
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Codeword 
Index a 


Codeword 
Index b 


Codeword Index 
a 


Codeword 
Index b 


Codeword 
Index a 


Codewor 
d Index b 


0 


— 


16 


48 


32 


— 


1 


3 


17 


51 


33 


35 


2 


— 


18 


50 


34 


— 


4 


12 


19 


49 


36 


44 


5 


15 


20 


60 


37 


47 


6 


14 


, 21 


63 


38 


46 


7 


13 


22 


62 


39 


45 


8 




. 23 


61 


40 


— 


9 


11 


24 


56 


41 


43 


10 


— 


25 


59 


42 








26 


58 










27 


57 










28 


52 










29 


55 










30 


54 










31 


53 







Table 5, Codeword Conjugate Pairs 
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D. FCT Based on CCK Code Properties 

[0146] Another less complex embodiment can be derived by considering the 

equations for generating the CCK correlation output: 

d = c 1 x 1 + c 6 x 6 + c s x 5 + c 4 x 4 + £3*3 + c 2 x 2 + c x x x + c 0 x 0 (Eq. 16) 
[0147] where c 79 c 69 c S9 c 49 c^ 9 c 29 c X9 c 0 are the complex coefficients and 

x 7 9 x 6 , x 5 , x 4 , x 3 9 x 2 , jc, , x 0 are the complex input buffer samples. The codewords 
can be uniquely defined by the three coefficients, c 6 ,c 5 ,c 3 , with the other 
coefficients defined by 



C 0 


— C 6 C 5 C 3 


c, 


= c 5 c 3 


c 2 


= c 6 c 3 




= ~c 6 c 5 




= 1 



[0148] Thus, after substituting coefficient relationships and rearranging, the 

correlation output becomes: 

d = x 7 +c 6 x 6 +c 5 (x 5 -c 6 x 4 )+c,{x 2 +c 6 x 2 +c s [x x +c 6 x 0 J(Eq. 17) 
[0149] For correlation with a single codeword, the resulting structure for equation 

170 is shown in FIG. 25. 
[0150] The FCT is derived from the single correlate by converting each 

coefficient branch, starting at the input, to a positive and negative branch. The 

resulting structure is shown in FIGS. 26 and 27^ 
[0151] Stage 1 consists of 4 additions or subtractions. 

D 0 =x 0 D x =-x 0 +*,, D 2 =x 29 D 3 =x 3 
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D 4 =x 4 +x s , D s = -x 4 + x 5 , D 6 = x 6 , D 7 = x 7 
Stage 2 consists of 8 additions or subtractions. 

E 0 =Z) 0 + D 2 , E x = D 2 -D 0 , E 2 =£>, +D 2 , E 3 = D X -D 2 

E 4 =D 4 +D 6 , E 5 =D 6 -D 4 , E 6 = D 5 +D 6 , E 7 = D 5 -D 6 
Stage 3 consists of 16 additions or subtractions. 

B 0 = E 0 + D 3 , - B 3 = E x + D 3 , - B 2 = -£ 0 + D 3 , B x = + £> 3 

B 4 =E 2 +D 3 ,B S = E 3 +D 3 , -B 6 = -E 2 +D 3 , -B 7 = -E 3 +D 3 

£ 16 = E 4 + D 7 , B n = E S +D 7 , B 23 = -E 4 + D 7 , B 20 = -E 5 + D 7 

B X1 = E 6 + D 7 , B 2l = E 7 +D 7 , B 22 = -E 6 + D 7 , B K = -E 7 + D 7 
Stage 4 consists of 40 additions or subtractions. 

A 0 =B 2 +B 20 , A x =-B 2 +B 20 , A 2 = -B x +B 23 , A 3 =-B 2 + B 23 

A 4 =-B x +B 20 , A 5 = B X + B 23 , A 6 =B 2 + B 23 , A 7 =B X +B 20 

A S =B 7 +B l7 , A, =-B 2 + B X7 , A X0 =B 7 +B 20 , A n = -B 7 +B x7 

A n =B 2 + B xl , A x3 = -B 7 + B 20 , A l4 = -B 0 + B i9 , A X5 =B 6 + B 2l 

A X6 = -B 6 + B X9 , A x7 = -B 0 + B 2X , A l8 = B 0 + B x9 , A i9 = -B 6 + B 2X 

A 20 =B 6 +B 19 , A 2l =B 0 +B 21 , A 22 =-B 4 + B XS , A 23 = B x + B xg 

A 24 = -B 4 + B 23 , A 25 =B 4 + B xs , A 26 = -B x + B IS , A 27 = B 4 + B 23 

A 2% = B 3 + B X6 , A 29 = -B 5 +B 22 A 30 = B 5 + B X6 , A 3X =B 3 +B n 

A 32 =-B 3 +B X6 , A 33 =B 5 +B 22 , 4 4 =-5 5 +£ I6 , 

A 3S =-B 3 +B 22 

= -B, + B lg , A 37 = -B 4 + B X7 , A n =B 7 + B xs , A 39 =B 4 + B X7 
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Table 6 



Index 

0 


Bits 


Complex 
Codeword 


I,Q Codeword 


I,Q Combo 


000000 


+1+1+1-1+1+1-1+1 


+1+1+1-1+1+1-1+1 
+1+1+1-1+1+1-1+1 


B2+B20 
B2+B20 


i 


000001 


+j+j+j - j+1+1-1+1 


-l-i-i+i+i+i-i+i 
+1+1+1-1+1+1-1+1 


-B2+B20 
B2+B20 


2 


000010 


-l-i-i+i+i+i-i+i 


-l-i-i+i+i+i-i+i 
-l-i-i+i+i+i-i+i 


-B2+B20 
-B2+B20 


3 


000011 


-j-j-j+j+i+i-i+i 


+1+1+1-1+1+1-1+1 
-l-i-i+i+i+i-i+i 


B2+B20 
-B2+B2 0 


4 


000100 


+j+j+i-i+j+j -l+i 


-i-i+i-i-i-i-i+i 
+1+1+1-1+1+1-1+1 


-B1+B23 
B2+B20 


5 


000101 


-i-i+j-j+j+j-i+i 


-l-i-i+i-i-i-i+i 
-i-i+i-i+i+i-i+i 


-B2+B23 
-B1+B20 


6 


000110 


-j - j -l+i+j+j -l+i 


+1+1-1+1-1-1-1+1 
-l-i-i+i+i+i-i+i 


B1+B23 
-B2+B20 


7 


000111 


+i+i- j+j+j+j -l+i 


+1+1+1-1-1-1-1+1 
+1+1-1+1+1+1-1+1 


B2+B23 
B1+B20 


8 


001000 


-l-i+i-i-i-i-i+i 


-i-i+i-i-i-i t i+i 
-i-i+i-i-i-i-i+i 


-B1+B23 
-B1+B23 


9 


001001 


-j-j+j-j-1-1-1+1 


+1+1-1+1-1-1-1+1 
-i-i+i-i-i-i-i+i 


B1+B23 
-B1+B23 


;io 


001010 


+1+1-1+1-1-1-1+1 


+1+1-1+1-1-1-1+1 
+1+1-1+1-1-1-1+1 


B1+B23 
B1+B23 


11 


001011 


+j+j-j+j-i-i-i+i 


-i-i+i-i-i-i-i+i 
+1+1-1+1-1-1-1+1 


-B1+B23 
B1+B23 


12 


001100 


-j-j+i-i-j-j-i+i 


+1+1+1-1+1+1-1+1 
-i-i+i-i-i-i-i+i 


■. B2+B20 
-B1+B23 


13 


001101 


+i+i+j-j-j-j-i+i 


+1+1-1+1+1+1-1+1 
+1+1+1-1-1-1-1+1 


B1+B20 
B2+B23 


14 


001110 


+j+j-i+i-j-j-i+i 


-i-i-i+i+i+i-i+i 
+1+1-1+1-1-1-1+1 


-B2+B20 
. B1+B23 


15 


001111 


-l-i-j+j-j-j-i+i 


-i-i+i-i+i+i-i+i 
-l-i-i+i-i-i-i+i 


-B1+B20 
-B2+B23 



-46- 



Table 6 (continued) 



Index 


Bits 


Complex 
Codeword 


I,Q Codeword 


•I,Q Combo 


16 


010000 


+j+i+j -i+j+i- j+i 


-l+i-i-i-i+i+i+i 
+1+1+1-1+1+1-1+1 


B7+B17 
B2+B20 


17 


010001 


-i+j-i-j+j+i-j+i 


-l-i-i+i-i+i+i+i 
-l+i-i-i+i+i-i+i 


-B2+B17 
B7+B20 


18 


010010 


-j-i-j+i+j+i-j+i 


+1-1+1+1-1+1+1+1 
-l-i-i+i+i+i-i+i 


-B7+B17 
-B2+B20 


19 


010011 


+i- j+i+j +j+i- j+i 


+1+1+1-1-1+1+1+1 
+1-1+1+1+1+1-1+1 


B2+B17 
-B7+B20 


20 


010100 


-i+j+j-i-i+j-j+i 


-i-i-i-i-i-i+i+i 
-l+i+i-i-i+i-i+i 


-B0+B19 
B6+B21 


21 


010101 


-j-i-i-j-i+j-j+i 


+1-1-1+1-1-1+1+1 
-l-i-i-i-i+i-i+i 


-B6+B19 
-B0+B21 


22 


010110 


+i-j-j+i-i+j-j+i 


+1+1+1+1-1-1+1+1 
+1-1-1+1-1+1-1+1 


B0+B19 
-B6+B21 


23 


010111 


+j+i+i+j -l+j - j+i 


-l+i+i-i-i-i+i+i 
+1+1+1+1-1+1-1+1 


B6+B19 
B0+B21 


24 


011000 


-j-i+j-i-j-i-j+i 


+1-1-1-1+1-1+1+1 
-l-i+i-i-i-i-i+i 


-B4+B18 
-B1+B23 


25 


011001 


+i-j-i-j-j-i-j+i 


+1+1-1+1+1-1+1+1 
+1-1-1-1-1-1-1+1 


B1+B18 
-B4+B23 


.26 


011010 


+j+i-j+i-j-i-j+i 


-l+i+i+i+i-i+i+i 
1 +1+1-1+1-1-1-1+1 


B4+B18 
B1+B23 


27 


011011 


-i+j+i+j-j-i-j+i 


-l-i+i-i+i-i+i+i 
-l+i+i+i-i-i-i+i 


-B1+B18 
B4+B23 


28 


011100 


+i-j+j-i+i-j-j+i 


+1+1-1-1+1+1+1+1 
+1-1+1-1+1-1-1+1 


B3+B16 
-B5+B22 


29 


011101 


+j+i-i-j+i-j-j+i 


-l+i-i+i+i+i+i+i 
+1+1-1-1+1-1-1+1 


B5+B16 
B3+B22 


30 


011110 


-i+j-j+i+i-j-j+i 


-l-i+i+i+i+i+i+i 
-l+i-i+i+i-i-i+i 


-B3+B16 
B5+B22 


31 


011111 


-j-i+i+j+i-j-j+i 


+1-1+1-1+1+1+1+1 
-l-i+i+i+i-i-i+i 


-B5+B16 
-B3+B22 
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Table 6 (continued) 



Index 


Bits 


Complex 
Codeword 


I,Q Codeword 


I,Q Combo 


32 


100000 


-l+i-i-i-i+i+i+i 


-l+i-i-i-i+i+i+i 
-l+i-i-i-i+i+i+i 


B7+B17 
B7+B17 


33 


100001 


- j+j ~ j ~ j -l+i+i+i 


+1-1+1+1-1+1+1+1 
-l+i-i-i-i+i+i+i 


-B7+B17 
B7+B17 


34 


100010 


+1-1+1+1-1+1+1+1 


+1-1+1+1-1+1+1+1 
+1-1+1+1-1+1+1+1 


-B7+B17 
-B7+B17 


35 


100011 


+j -j+j+j -l+i+i+i 


-l+i-i-i-i+i+i+i 
+1-1+1+1-1+1+1+1 


B7+B17 
-B7+B17 


36 


100100 


-j+j-i-i-j+j+i+i 


+1-1--I-1+1-1+1+1 
-l+i-i-i-i+i+i+i 


-B4+B18 
B7+B17 


37 


100101 


+i-i-j-j-j+j+i+i 


+1-1+1+1+1-1+1+1 
+1-1-1-1-1+1+1+1 


-B7+B18 
-B4+B17 


38 


100110 


+j-j+i+i-j+j+i+i 


-l+i+i+i+i-i+i+i 
+1-1+1+1-1+1+1+1 


B4+B18 
-B7+B17 


39 


100111 


-l+i+j+j -j+j+i+i 


-i+i-i-i+i-i+i+i 
-l+i+i+i-i+i+i+i 


B7+B18 
B4+B17 


■ 40 


101000 


+1-1-1-1+1-1+1+1 


+1-1-1-1+1-1+1+1 
+1-1-1-1+1-1+1+1 


-B4+B18 
-B4+B18 


41 


101001 


+j-j-j-j+i-i+i+i 


-l+i+i+i+i-i+i+i 

+1-1-1-1+1-1+1+1 


B4+B18 
-B4+B18 


',42 


101010 


-l+i+i+i+i-i+i+i 


-1+1+1+1+1-1+1+1 
-1+1+1+1+1-1+1+1 


B4+B18 
B4+B18 


43 


101011 


-j+j+j+j+1-1+1+1 


+1-1-1-1+1-1+1+1 
-1+1+1+1+1-1+1+1 


-B4+B18 
B4+B18 


44 


101100 


+j - j -1-1+j - j+i+i 


-1+1-1-1-1+1+1+1 
+1-1-1-1+1-1+1+1 


B7+B17 
-B4+B18 


45 


101101 


-1+1-j-j+j-j+i+i 


-1+1+1+1-1+1+1+1 
-1+1-1-1+1-1+1+1 


B4+B17 
B7+B18 


46 


101110 


- j+j+i+i+j - j+i+i 


+1-1+1+1-1+1+1+1 
-1+1+1+1+1-1+1+1 


-B7+B17 
B4+B18 


47 


101111 


+i-i+j+j+j -j+i+i 


+1-1-1-1-1+1+1+1 
+1-1+1+1+1-1+1+1 


-B4+B17 
-B7+B18 
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Table 6 (continued) 



Index 


Bits 


Complex 
Codeword 


I,Q Codeword 


I,Q Combo 


48 


110000 


- j+i- j -l- j+i+j+i 


+1+1+1-1+1+1-1+1 

-1+1-1t1-1+1+1+1 


B2+B20 
B7+B17 


49 


110001 


+i+j+i- j - j +i+j+i 


+1-1+1+1+1+1-1+1 
+1+1+1-1-1+1+1+1 


-B7+B20 
B2+B17 


50 


110010 


+j -i+j+i-j+i+j+i 


-1-1-1+1+1+1-1+1 
+1-1+1+1-1+1+1+1 


-B2+B20 
-B7+B17 


51 


110011 


-i-j-i+j-j+i+j+i 


-1+1-1-1+1+1-1+1 
-1-1-1+1-1+1+1+1 


B7+B20 
-B2+B17 


52 


110100 


-j-i+i+j+j+i 


+1-1+1-1+1-1-1+1 
+1+1-1-1+1+1+1+1 


-B5+B22 
B3+B16 


53 


110101 


+j -1+1-j+i+j+j+i 


-1-1+1+1+1-1-1+1 
+1-1+1-1+1+1+1+1 


-B3+B22 
-B5+B16 


■54 


110110. 


-l- j+j+i+i+j+j+i 


-1+1-1+1+1-1-1+1 
-1-1+1+1+1+1+1+1 


B5+B22 
-B3+B16 


55 


110111 


- j+i-i+j+i+j+j+i 


+1+1-1-1+1-1-1+1 
-1+1-1+1+1+1+1+1 


B3+B22 
B5+B16 


56 


111000 


+j-i-j-i+j-i+j+i 


-1-1+1-1-1-1-1+1 
+1-1-1-1+1-1+1+1 


-B1+B23 
-B4+B18 


57 


111001 


-i-j+i-j+j-i+j+i 


-1+1+1+1-1-1-1+1 
-1-1+1-1+i-i+i+i 


B4+B23 
-B1+B18 


58 


111010 


-j+i+j+i+j -l+j+i 


+1+1-1+1-1-1-1+1 
-l+i+i+i+i-i+i+i 


B1+B23 
B4+B18 


59 


111011 


+i+j -l+j+j -l+j+i 


+1-1-1-1-1-1-1+1 
+1+1-1+1+1-1+1+1 


-B4+B23 
B1+B18 


60 


111100 | 


-i-j-j-i-i-j+j+i 


-l+i+i-i-i+i-i+i 
-l-i-i-i-i-i+i+i 


B6+B21 
-B0+B19 


61 


111101 i 


-j+1+1- j-i- j+j+i 


+1+1+1+1-1+1-1+1 
-l+i+i-i-i-i+i+i 


B0+B21 
B6+B19 


62 


111110 


+i+j+j+i-i- j+j+i 


+1-1-1+1-1+1-1+1 
+1+1+1+1-1-1+1+1 


-B6+B21 
B0+B19 


63 


111111 


+j-i-i+j-i-j+j+i 


-i-i-i-i-i+i-i+i 
+1-1-1+1-1-1+1+1 


-B0+B21 
-B6+B19 



XI. Conclusion 

[0155] While various embodiments of the present invention have been described 

above, it should be understood that they have been presented by way of example 
only, and not limitation. Other embodiments are possible and are covered by the 
invention. 



